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Abstract 

In the design and analysis of revenue-maximizing auctions, auction performance is typ¬ 
ically measured with respect to a prior distribution over inputs. The most obvious source 
for such a distribution is past data. The goal of this paper is to understand how much data 
is necessary and sufficient to guarantee near-optimal expected revenue. 

Our basic model is a single-item auction in which bidders’ valuations are drawn indepen¬ 
dently from unknown and non-identical distributions. The seller is given m samples from 
each of these distributions “for free” and chooses an auction to run on a fresh sample. How 
large does m need to be, as a function of the number k of bidders and e > 0, so that a 
(1 — e)-approximation of the optimal revenue is achievable? 

We prove that, under standard tail conditions on the underlying distributions, m = 
poly(/c, samples are necessary and sufficient. Our lower bound stands in contrast to 
many recent results on simple and prior-independent auctions and fundamentally involves 
the interplay between bidder competition, non-identical distributions, and a very close (but 
still constant) approximation of the optimal revenue. It effectively shows that the only way 
to achieve a sufficiently good constant approximation of the optimal revenue is through a 
detailed understanding of bidders’ valuation distributions. Our upper bound is constructive 
and applies in particular to a variant of the empirical Myerson auction, the natural auction 
that runs the revenue-maximizing auction with respect to the empirical distributions of the 
samples. 

Our sample complexity lower bound depends on the set of allowable distributions, and 
to capture this we introduce a-strongly regular distributions, which interpolate between the 
well-studied classes of regular {a = 0) and MHR {a = 1) distributions. We give evidence 
that this definition is of independent interest. 


1 Introduction 

Comparing the revenue of two different auctions requires an analysis framework for trading off 
performance on different inputs. For instance, in a single-item auction, a second-price auction 
with a reserve price r > 0 will earn more revenue than a second-price auction with no reserve 
price on some inputs, and less on others. Which auction is better? 

The conventional approach in auction theory is Bayesian, or average-case, analysis. That 
is, bidders’ valuations are assumed to be drawn from a distribution, and one auction is defined 
to be better than another if it has higher expected revenue with respect to this distribution. 
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The optimal auction is then the one with the highest expected revenue. The optimal auction 
depends on the assumed distribution, in some cases in a detailed way. 

While there is now a significant body of work on worst-case revenue maximization (see m), 
a majority of modern computer science research on revenue-maximizing auctions uses Bayesian 
analysis to measure auction performance (see [211 )• Since the comparison between auctions 
depends fundamentally on the assumed distribution, an obvious question is: where does this 
prior distribution eome from, anyway? 

In most applications, and especially in computer science contexts, the answer is equally 
obvious: from past data. For example, in Yahoo!’s keyword auctions, Bayesian analysis was 
used to provide guidance on how to set per-click reserve prices, and the valuation distributions 
used in this analysis are derived straightforwardly from bid data from the recent past [35] . This 
is a natural approach, but how well does it work? 

1.1 The Model 

The goal of this paper is to understand how much data is necessary and sufficient to guarantee 
near-optimal expected revenue. Our model is the following. There are k bidders in a single-item 
auction. The valuation (i.e., willingness-to-pay) of bidder i is a sample from a distribution 
The Ffs are independent but not necessarily identical. 

The distribution F = Fi x • • • x is unknown to the seller. The “data” comes in the form of 
m independent and identically distributed (i.i.d.) samples ..., from F — equivalently, 
m i.i.d. samples from each of the k individual distributions Fi,... ,Fj^. The seller observes the 
samples and then commits to a truthful auction We call this function from samples to 
auctions an m-sample auetion strategy. The seller then earns the revenue of its chosen auction 
A on the “real” input, a fresh independent sample from F. See also Figure We can 

state our main question as follows. 

(*) How many samples m are neeessary and suffieient for the existenee of an m-sample auetion 
strategy that, for every distribution F in some elass V, has expeeted revenue at least (1 —e) 
times that of the optimal auetion for F ? 

The expected revenue of an auction strategy is with respect to both the samples ..., 
and the input — i.e., over m -\- 1 i.i.d. samples from F. The expected revenue of an 

optimal auction is with respect to a single sample (the input) from F. Our formalism is inspired 
by computational learning theory [38] . 

The answer to the question (*) could be a function of up to three different parameters: 
the error tolerance e, the number k of bidders, and the set V of allowable distribution^ It is 
clear that some restriction on T> is necessary for the question (*) to be interesting: without any 
restriction, no finite number of samples is sufficient to guarantee near-optimal revenue, even 
when there is only one bidder]^ 

^An auction is truthful if truthful bidding is a dominant strategy for every bidder. That is: for every bidder i, 
and all possible bids by the other bidders, i maximizes its expected utility (value minus price paid) by bidding 
its true value. For single-item auctions, the optimal expected revenue of any (possibly non-truthful) auction, 
measured at a Bayes-Nash equilibrium with respect to the prior distribution, is no larger than that of the optimal 
truthful auction. Also, the restriction to dominant strategies is natural given our assumption of an unknown 
distribution. 

^As the distribution F is unknown, we seek uniform sample complexity bounds, meaning bounds that depend 
only on T> and not on F. 

^To see this, consider all distributions that take on a value with probability ^ and 0 with probability 
1 — The optimal auction for such a distribution earns expected revenue M. It is not difficult to prove that, 
for every m, there is no m-sample auction strategy with near-optimal revenue for every such distribution — 
for sufficiently large M, all m samples are 0 with high probability and the auction strategy has to resort to an 
uneducated guess for M. 
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Figure 1: A single-item auction A is chosen as a function of s i.i.d. samples vi,..., from an 
unknown distribution Fi x • • • x and applied to a fresh sample from the same distri¬ 
bution. The benchmark is the expected revenue of the Myerson-optimal auction for Fi,..., 


Our model can be viewed as an interpolation between worst-case and average-case analysis. 
It inherits much of the robustness of the worst-case model, since we demand guarantees for 
every underlying F, while allowing very good approximation guarantees with respect to a strong 
benchmark. 

1.2 Distributional Assumptions 

Two distributional assumptions that have been extensively used (see e.g. 121) are the regularity 
and monotone hazard rate (MHR) conditions. The former asserts that the “virtual valuation” 
function Vi — ^ is nondecreasing, where fi is the density of F^, while the second imposes 

the strictly stronger condition that is nondecreasing. The “most tail-heavy” regular 

distribution has the distribution function Fi{vi) = 1 — while the most tail-heavy MHR 

distributions are the exponential distributions. 

Our lower bound on the sample complexity of revenue maximization depends on the set 
of allowable distributions, and to capture this we introduce a parameterized condition that 
interpolates between the regularity and MHR conditions; this condition is also useful in other 
contexts (see Section Q. 

Definition 1.1. (ct-Strongly Regular Distribution) Let F he a distribution with positive 
density funetion f on its support [a, b], where 0 < a < 00 and a < b < 00 . Let (p{v) — v — ^ 
denote the eorresponding virtual valuation funetion. F is cr-strongly regular if 

(p{y) - (p{x) > a{y - x) (1) 


whenever y > x >{). 

For distributions with a differentiable virtual valuation function (/p, condition 0 is equiv¬ 
alent to ^ > a. Regular and MHR distributions are precisely the 0- and 1-strongly regular 
distributions, respectively. A product distribution F = Fi x • • • x F^^ is called a-strongly regular 
if each Fi is cr-strongly regular. For the lower bound, we take the set V of allowable distributions 
in (*) to be the a-strongly regular distributions for a parameter a G [0,1]. 

1.3 Our Results 

Our main result is that m — poly(A;, y) samples are necessary and sufficient for the existence 
of an m-sample auction strategy that, for every strongly regular distribution F, has expected 
revenue at least (1 — e) times that of an optimal auction. 


3 







Both our upper and lower bounds on the sample complexity of revenue maximization are 
significant. For the lower bound, it is far from obvious that the number of samples per bidder 
needs to depend on k at all, let alone polynomially. Indeed, for many relaxations of the problem 
we study, the sample complexity is a function of e only. 

• If there is an unlimited supply of items (digital goods), then the problem reduces to sepa¬ 
rate single-bidder problems, for which poly(^) samples suffice for a (1 — e)-approximation 
for all regular distributions m Lemma 4.1]. 

• If bidder valuations are independent and identical draws from an unknown regular distri¬ 
bution, then poly(^) samples suffice for a (1 — e)-approximation [181 Theorem 4.3]. 

• If only a approximation of the optimal expected revenue is required, then only a single 
sample is required. This follows from a generalization of the Bulow-Klemperer theorem [7] 
to non-i.i.d. bidders m Theorem 4.4]. 

Thus, the necessary dependence on k fundamentally involves the interplay between bidder 
competition, non-identical distributions, and a very close (but still constant) approximation of 
the optimal revenue. 

On a conceptual level, our lower bound shows that designing c-approximate auctions for 
constants c sufficiently close to 1 is a qualitatively different problem than for more modest 
constants like For example, previous work has demonstrated that auctions with reasonably 
good approximation factors are possible with minimal dependence on the valuation distributions 
(e.g. BMW or even, when there is no bidder with a unique valuation distribution, with no 
dependence on the valuation distributions [nillHlESl. Another interpretation of some previous 
results, such as UDEl, is the existence of constant-factor approximate auctions that derive 
no benefit from bidder competition. Our lower bound identifies, for the first time, a constant 
approximation threshold beyond which “robustness” and “prior-independence” results of these 
types cannot extend. Our argument formalizes the idea that, with two or more non-identical 
bidders, the only way to achieve a sufficiently good constant approximation of the optimal 
revenue is through a detailed understanding of bidders’ valuation distributions and an essentially 
optimal resolution of bidder competition. 

We provide an upper bound on the number of samples needed for near-optimal approxima¬ 
tion by analyzing a natural auction strategy. Recall that for a distribution F that is known a 
priori, Myerson’s optimal auction gives the item to the bidder with the highest virtual valua¬ 
tion (pi{vi) — Vi — fo no one if all virtual valuations are negative [33]. The empirical 

Myerson auction is the obvious analog when one has data rather than distributional knowledge: 
define Fi as the empirical distribution of the samples from and run the optimal auction for 
F0 We prove that a variant on the empirical Myerson auction has expected revenue at least 
(1 — e) times optimal provided it is given a sufficiently large polynomial number of samples]^ 
A key aspect in our analysis is identifying the (non-pointwise) sense in which empirical virtual 
valuation functions approximate the actual virtual valuation functions; this is non-trivial even 
for the special case of MHR distributions. 

^Since the empirical distributions are generally not regular even when the underlying distributions F are, a 
standard extra “ironing” step is required; see Section for details. 

^Left unmodified, the empirical Myerson mechanism can be led astray by poor approximations at the upper 
end of the valuation distributions caused by a small sample effect. We prove that excluding the very highest 
samples from the empirical distributions addresses the problem. 
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1.4 Technical Approach 


The proofs of our upper and lower bounds are fairly technical, so we provide here an overview of 
the main ideas. We begin with the upper bound, which roughly consists of the following steps. 


1. (Lemma 6.2) For some fixed bidder with distribution F, consider the corresponding m 
samples vi > • • • > v^. Define the “empirical quantile” qj of Vj as the expected 

quantile of the jth order statistic. Taking a “net” of quantiles and applying standard 
large deviation bounds shows that all but the bottom ^ fraction of the empirical quantiles 
are good multiplicative approximations of their expectations with high probability; here 
^ > 0 is a key parameter that will depend on k and e. 


2. (Lemma 6.4) Recall that the expected revenue of an auction equals its expected virtual 
welfare [33]. Myerson’s optimal auction maximizes virtual welfare pointwise, whereas our 
auction maximizes (ironed) empirical virtual welfare pointwise. In a perfect world, we 
would be able to argue that the empirical virtual valuation functions are good pointwise 
approximations of the true virtual valuation functions, and hence the expected virtual 
welfare of our auction is close to that of Myerson’s auction. Unfortunately, good relative 
approximation of quantiles does not necessarily translate to good relative approximation 
of virtual valuations. The reason is that a virtual valuation function v — ^ can 

change arbitrarily rapidly in a region where the density changes rapidly (even for MHR 
distributions). 


We instead prove a different sense in which empirical virtual values approximate actual 
virtual values, working in the (quantile) domain as well as in the range of the virtual 
valuation functions. Recall that the quantile q{v) is defined as 1 — F{v). We show that for 
suitable Ai, A 2 > 0, for all but the top ^ fraction of quantiles in [0,1], with high probability 
the empirical virtual value ip{q) is sandwiched between (p{q{l + Ai)) and (f{q/{l + A 2 )), 
modulo small additive factors. (By ip{q) we mean — q)).) The additive factors 

are functions of 1/g, as well as of k and e, which complicates the analysis. 


3. (Lemmas |6.8| and |6.9| ) Consider a fixed bidder i. By the previous step, up to additive 
factors, we can lower bound the virtual welfare contributed by a bidder i with a quantile 
qi = I — Fi{vi) outside the top ^ fraction in the empirical Myerson mechanism by the 
virtual value contributed by i in the optimal auction when it has a quantile of g^(l + Ai). 
Or not quite: an additional issue is that the empirical virtual valuation of a different 
bidder j with quantile qj might be larger than its true virtual value, leading the empirical 
Myerson auction to allocate to j over the rightful winner i, and resulting in a reduction in 
the total virtual welfare being accumulated as compared to the actual Myerson auction. 
The difference in virtual values is bounded by the additive factors described in Q; as 
the outcomes are determined in the empirical auction, we will need two sets of factors: 
for bidder i, the factors from the lower sandwiching bound, and for bidder j, the factors 
from the upper sandwiching bound. We will show that there is only a small probability of 
large additive factors, which suffices to bound the expected reduction in revenue when the 
additive factors are large. When the additive factors are small, their contribution to the 
reduction in revenue is also small. All the reductions end up being polynomial functions 
of k and e. 


4. (Lemma 6.6 and 6.7) There is one more issue. Because of the shift in quantile space — 
we compare the virtual value in the Myerson auction at quantile g^(l + Ai) to the virtual 
value in the empirical auction at quantile qi — and also because the reserve prices in 
the two auctions may differ, we also have to analyze the revenue loss at the lower end of 
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the distributions, or more precisely, around the reserve prices. This too is a polynomial 
function of k and e. 

We now discuss the lower bound proof. This involves arguing that, if the number of samples 
is too small, then for every auction strategy, there exists a distribution for which the auction 
strategy’s expected revenue is not near-optimal. We prove this by exhibiting a “distribution 
of distributions” and proving that every auction strategy has expected revenue — where the 
expectation is now with respect to both the initial random choice of the valuation distributions, 
and then with respect to both the m samples and the input — bounded away from the expected 
revenue of an optimal auction (where the expectation is over both the choice of distributions 
and the input). We are unaware of any other lower bounds in auction theory that have this 
form. 

Our construction involves taking a base set of “worst-case” a-strongly regular distributions 
and truncating them at random points. A key observation is that, when such a distribution is 
truncated at a point the corresponding virtual valuation function is linear with coefficient 
a except at the truncation point, where the virtual valuation jumps to Hi. The high-level 
intuition is that, when confronted with valuations that are higher than those seen in any of the 
samples, no auction can know whether a high valuation v corresponds to a truncation point 
(with virtual value v) or not (with virtual value only a{v — 1)). Properly implemented, this idea 
can be used to prove that every auction strategy errs with constant probability on precisely the 
set of inputs that contribute the lion’s share of the optimal revenue. The lower bound follows. 

1.5 Prior and Concurrent Related Work 

We provide detailed comparisons only with the papers most closely related to the present work. 
For previously studied models about revenue-maximization with an unknown distribution, which 
differ in various respects from our model, see H EH- For other uses of samples in auction 
design that differ from ours, see Fu et al. EH, who use samples to extend the Cremer-McLean 
theorem [15] to partially known valuation distributions, and Chawla et al. m, who design 
non-truthful auctions that both have equilibrium revenue within a constant of optimal and 
enable accurate inference about the valuation distribution from samples. For other ways to 
parameterize partial knowledge about valuations, see e.g. Azar et al. [3| and Chiesa et al. [12]. 
For asymptotic optimality results in various symmetric settings, which identify conditions under 
which the expected revenue of some auction of interest (e.g., second-price) approaches the 
optimal with an increasing number of (i.i.d.) bidders, see Neeman [3l], Segal m, Baliga and 
Vohra [6], and Goldberg et al. m For applications of learning theory concepts to prior-free 
auction design in unlimited-supply settings, see Balcan et al. [5]. Finally, the technical issue 
of ironing from samples comes up also in Ha and Hartline [23], in the context of prior-free 
mechanism design, and the aforementioned Chawla et al. m- Our goal of obtaining a (1 — e)- 
approximation of the maximum revenue achieved by any auction is impossible in the more 
demanding settings of [231 [TO]. 

Elkind [20] studies a learning problem closely related to ours, in the restricted setting of 
discrete distributions with known finite supports but with unknown probabilities. In the model 
in m, learning is done using an oracle that compares the expected revenue of pairs of auctions, 
and 0{in?K‘^) oracles calls suffice to determine the optimal auction (where n is the number of 
bidders and K is the support size of the distributions). Elkind [20] notes that such oracle calls 
can be implemented approximately by sampling (with high probability), but no specific sample 
complexity bounds are stated. 

Dhangwatnotai et al. EH], motivated by “prior-independent” auctions that are simultane¬ 
ously approximately optimal for a wide range of valuation distributions, implicitly studied the 
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single-bidder version of the learning problem we that study. With one bidder, the goal is to 
learn approximately the monopoly price of an unknown distribution from samples. Their results 
imply sample complexity upper bounds for this problem of and 0{e~^) for MHR and 

regular distributions, respectively]^ As our results show, the learning problem is quite different 
and more delicate with multiple non-i.i.d. bidders. 

The papers of Cesa-Bianchi et al. [8] and Medina and Mohri 1301 give algorithms for learning 
the optimal reserve-price-based single-item auction. Our problem of learning the best single¬ 
item auction — whether reserve-based or otherwise — is harder. With non-i.i.d. bidders, there 
need not be a reserve price for which the second-price auction has expected revenue more 
than 50% times the optimal [26] . 

Concurrently with our work, Dughmi et al. m proved negative results (exponential sample 
complexity) for learning near-optimal mechanisms in multi-parameter settings that are much 
more complex than the single-item auctions studied here. The paper also contains positive 
results for restricted classes of mechanisms. 

1.6 Subsequent Related Work 

The preliminary version of this paper m motivated several follow-up works. Huang et al. [28] 
study the single-bidder version of our problem, studied implicitly in m, and give optimal 
sample complexity bounds under several different distributional assumptions. Both the upper 
and lower bounds in [28] improve, in terms of the dependence on over those implied by the 
present work. This is also the only paper other than the present work that proves any sample 
complexity lower bounds. 

Morgenstern and Roughgarden [32] adapt tools from statistical learning theory [1] to give 
general sample complexity upper bounds that cover all single-parameter settings for bounded 
or MHR valuation distributions. The results in [32] apply to many more environments than 
single-item auctions, and improve over the sample complexity upper bound of the present work 
(even for single-item auctions), but unlike the present work, their results do not apply to regular 
distributions and do not result in computationally efficient learning algorithms. 

Very recently, Devanur et al. m devised a different learning algorithm for our problem 
which provides a strict improvement on our upper bound. The sample complexity bound in m 
is roughly the same as in [32], but the learning algorithm is computationally efficient and also 
accommodates regular distributions. 

Finally, Cole and Shravas investigated further applications of a-strongly regular distribu¬ 
tions [H]. 

2 Preliminaries 

This section reviews Myerson’s optimal single-item auction |22] for the case of known distri- 
butions. There are k bidders, and for each bidder i there is a distribution Fi from which its 
valuation is drawnQ 

For each buyer i, the auctioneer computes a virtual valuation = v — [1 — Fi{v)]/fi{v)^ 

where fi is the density function corresponding to Fi. (pi{v) is required to be a non-decreasing 

®Similarly, our sample complexity upper bound naturally leads to a prior-independent single-item auction. 
This auction achieves a (1 — e)-approximation of the optimal auction when bidders’ valuations are drawn from 
different regular distributions Fi,..., Fk and there are sufficiently many bidders of each type. 

^Our results extend to the case of k groups of an arbitrary number of bidders, where all the bidders from 
group i have i.i.d. valuations drawn from Fi. Then the sample bounds are a function of the number n of bidders, 
rather than the number k of groups, but the bound is on the number of samples needed from each group of 
bidders. 
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function of v. This holds by definition for regular distributions; in general, if this does not 
hold ifi can be modified (or ironed) so that it does hold, as implicitly explained in the next 
paragraph. Next, the auctioneer runs an analog of a second-price auction on the virtual values 
of the bids (virtual bids for short): the bidder, if any, with the highest non-negative virtual bid 
wins the auction (ties are broken arbitrarily) and is charged the minimum bid needed to win 
(or at least to tie for winning). More precisely, let i be the winning bidder and let 62 be the 
second highest virtual bid. Then the price is (/p^“^(min{0, 62 })- We note that (pi~^{0) can be 
viewed as a bidder-specific reserve price for i; it is also called the monopoly price for i. 

We can also describe the auction in terms of a revenue function. This also allows for 
situations where (fi{v) is not a nondecreasing function of v. The revenue function is computed 
in quantile space: qi{v) = 1 — Fi{v) is the probability that i will have a valuation of at least v. 
Now we view 'L’ as a function of qi. We introduce the expected revenue function, Ri{qi). It is a 
function of the quantile qp. Ri{qi) — v{qi) • qi is the expected revenue if i is the sole bidder and 
v{qi) is the price being charged. The auctioneer computes the smallest concave upper bound 
CRi{q) of Ri{q). Now ^i{v{qi)) is defined to be the slope of CRi(gi) (this yields an increasing 
function which coincides with the previous definition in the case of regular distributions). At 
points where there is no unique slope we choose (pi{v{q)) = lim(^/>^)^^ (^^(?;(g')). The auction 
then proceeds as before. Henceforth, overloading notation, we write (pi{qi) rather than p>i{v{qi)). 

Myerson [33] proved that for every auction, the expected virtual welfare equals the expected 
revenue. This result is important because, in many situations, it is much easier to reason about 
expected virtual welfare than directly about expected revenue. 

Theorem 2.1 (Myerson). The expected revenue of any single-item auction is given by 

k 

i=l 

where Xi{qi) is the probability (over others’ valuations and any coin flips by the auction) that i 
wins the item with a bid at quantile qi in Fi. 

Let q = (^ 1 , ^ 2 , • • • 5 ^fc) be a vector of quantiles drawn from Fi x F 2 x ... x We can 
rewrite the expected revenue as 


k « 

'^Eg^[ipi{qi) ■ Xi{qi)],= '^ / dq, 

i=l i dq 


( 2 ) 


where Ii(q) is the indicator function showing whether i wins when the bids are at quantiles q 
(or more generally, the probability that it wins). This immediately implies that allocating to a 
bidder with the highest virtual value, i.e. Myerson’s auction, is optimal. 


3 statement of Main Results 

We formally state our upper and lower bound results in turn. 

Theorem 3.1. In a single-item auction with k bidders with independent regular valuation dis¬ 
tributions, i/m = ri(^ In^ |), then there is an m-sample auction strategy with expected revenue 
at least 1 — e times that of an optimal auction. 

The auction strategy in Theorem |3.1| is a variant on the “empirical Myerson auction,” 
described in detail in Section 16.11 

Our lower bound result has an analogous form, although the polynomial in k and e is 
considerably smaller. Our lower bound grows larger as ct > 0 grows smaller. 


Theorem 3.2. For every auetion strategy 'S, for every k >2, for every sujfieiently small 6 > 0^ 
for every a > 0 and m satisfying: 


i. a = 1 and m < 


/ \ 1/2 

/ _ 1—In 2 _ j . 

I 96e^ min{l,^} lnmax{e,A;} J 


a. 0 < a < 1 , and m < ^ ^ 

in. 0 < a < 1, ^ < i, and m < 

. 0 < a < 1, and m < 


III 


£l/(l+a) ) 
i_c \ 1/(1+Q;) 


(fcai/(i-«)) 


a/(l+a) 


ei/(i+a) ; 


V. a = {) and m < g^f; 

there exists a set Fi^... of a-strongly regular valuation distributions sueh that the expeeted 
revenue of S (over the m samples and the input) is less than 1 — e times that of an optimal 
auetion for Fi,..., . 


At the two extreme poin ts o f MHR distributions {a — 1) and regular distributions (cr = 0), 
the lower bound in Theorem 3.2 is )? respectively. In all cases, the dependence 

on the number of bidders is linear or near-linear. 

We prove these two theorems in Sections and respectively. Before that, in the next 
section, we briefly indicate two applications of cr-strong regularity. 


4 Applications of Strong Regularity 


We believe our definition of ct-strongly regular distributions is of independent interest. Almost 
all previous expected revenue approximation guarantees for auctions in Bayesian settings apply 
to one of three sets of valuation distributions: all distributions, all regular distributions, or all 
MHR distributions (see e.g. 123). Strongly regular distributions interpolate between regular and 
MHR distributions, and should broaden the reach of many existing approximation bounds that 
are stated only for MHR distributions. To prove this point, we mention a couple of examples of 
such extensions; we are confident that many others are possible, as has been shown subsequently 
by Cole and Shravas m- 

The following property of MHR distributions is well known [271 Lemma 4.1]. 

Lemma 4.1 ([27]). Let F he an MHR distribution with monopoly priee r. If q{r) is the quantile 
of valuation r in the distribution F, then q{r) > 

We next show how to generalize this result to cf-strongly regular distributions. 


Lemma 4.2. Let F he an a-strongly regular distribution with a G (0,1) and monopoly priee r. 
If q{r) is the quantile of valuation r in the distribution F, then q{r) > 

Proof. Set A = 1 — cf. Let /i(-) denote the hazard rate of F, and choose c so that h{r) = 
Recall that = v — j^. Since (p{r) = 0 and hence h{r) = 1/r, we have c = r(l — A). 

The cf-strong-regularity condition, ^ > cf, implies that 1 + > cf, or ^ (^) < A. It 

follows that, for all v < r^ h{v) < and hence 


h{v) < 


1 

X{v — r) + r 
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Now write H{x) = fg h(v)dv, it is well known and easy to verify that g(v) = e We 

complete the proof by deriving 

q(r) = =e-/o 

> 15 ] 

= = (1 - A)^/^ = 


□ 


Hartline et al. 1271 Theorem 4.2] study a revenue maximization problem in social networks, 
and give a mechanism with approximation guarantee 


4-i 


.306 


(3) 


when players’ private valuations are drawn from MHR distributions]^ The MHR assumption is 
used only in applying Lemma [4T| Relaxing the distributional assumption to a-strong regularity 
and reoptimizing the proof in m Theorem 4.2] using Lemma |4.2| extends this approximation 
guarantee accordingly, with the term 1/e in Q replaced by 

For a second example. Hartline and Roughgarden [26l Theorem 3.2] consider downward- 
closed single-parameter environment^ and prove that, when bidders’ valuations are drawn from 
MHR distributions, the VCG mechanism with “eager” monopoly reserve prices has expected 


revenue at least ^ times that of an optimal mechanism. For an a-strongly regular distribution F 
with monopoly price r, and v > we have (p{v) > a{v — r) and hence 


r H- ip{v) > V] 

a 

this inequality generalizes Lemma 3.1 in [26] . Following the proof in |26^ Theorem 3.2] shows 
that, for every downward-closed single-parameter environment with bidders valuations drawn 
from (a-strongly regular distributions, the VCG mechanism with eager monopoly reserves has 
expected revenue at least times that of an optimal mechanism. 


5 The Lower Bound: Proof of Theorem 13.2 

Formal Statement Fix cf > 0 and 0 < 5 < 1, where 6 is sufficiently small. We show that for 
every auction strategy S, there exists a set Fi,... of cf-strongly regular distributions such 
that the expected revenue of the auction strategy (over the samples and the input) is at most 

®They also give a |-approximation algorithm for arbitrary valuation distributions. 

® A (binary) single-parameter environment is specified by a set of bidders and the feasible subsets of bidders 
that can simultaneously win. For example, if each bidder i wants a known bundle Si of items, then the feasible 
subsets are those in which the bundles of the chosen bidders are pairwise disjoint. Such an environment is 
downward closed if every subset of a feasible set is again feasible. 

^°In more detail, one reserve price per bidder i is fixed in advance. A bid is collected from each bidder. 
Bidders who bid below their reserve prices are removed from further consideration. From the remaining bidders, 
the mechanism chooses winners to maximize the sum of their bids, subject to feasibility. The mechanism charges 
the unique prices for which losing bidders pay 0 and truthful bidding is a dominant strategy. See [26] for details. 


10 










the following fraction of the expected revenue of the optimal auction for Fi^... 

l-ln2 


l-e(l,5) = l 

1 — 6((a, 5) = 1 
1 — 5 ) = 1 

1 — 6((a, 6) = 1 

1 — e((a, 5) = 1 


96e^ min{l, -} Inmaxje, k} 


1 - a2^-^ 
96e^ 

1 - a2^-^ 
96e^ 

1 - a2^-^ 


96e^ 


^l+a 

2^5 


(A:aV(i-«))c 


1 


96e^ 


for (a = 1 


for (a < 1 and | 

for (a < 1 and ^ | 


for (a < 1 and ^ 


for (a = 0 


We note that if (a < 1, then a2^ ^ <1 also. In addition, for fixed fc, — 1- 

Substituting k/m for 6 yields the bounds in Theorem 3.2 For ^ < a < 1 and sufficiently small 


constant 6 > 0, samples are necessary for a (1 — 6)-approximation. For the MHR 

{a = 1) case, Q{k/V^ Ink) samples are necessary, and for the regular (a = 0) case, Q{k/e) 
samples are needed. 

The Base Distributions We identify the worst-case distributions for a given (a > 0. Specif¬ 
ically, for V G [0, oc), consider 


F^{v) = 1 

= 1 — e 

riv) = 


1 


1 -h (1 — a)v 


1 

l—a 


1 -h (1 — a)v 
f\v) = e-\ 

The corresponding hazard rates are 
h^{v) = ^ 


2-a 

l—a 


1 -h (1 — a)v 


h\v) = 1 

with virtual valuation 


for 0 < (a < 1; 
for (a = 1; 

for 0 < (a < 1; 
for (a = 1. 

for 0 < (a < 1; 
for (a = 1; 


= av — 1 for 0 < (a < 1. 


A quick calculation shows that 


f ^ 

[0 

^r-\ 



1 ln4 

V q 




if 0 < (a < 1 
if (a = 1. 


(4) 


The Construction We define a distribution over distributions. Each bidder i is either type 
A or type B (50/50 and independently). For a type B bidder i, we draw q uniformly from the 
interval [^, |] and set Hi = — q). We then define bidder Fs distribution Fi as equal 

to F^ on [0, Hi) with a point mass with the remaining probability 1 — F^{Hi) at Hi. For a type 
A bidder we proceed similarly except that Hi is set to These distributions are 
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always a-strongly regular. An important point is that the virtual valuation of these bidders is 
given by 


ip{v) 


av — 1 \i V < Hi 
Hi \iv = Hi. 


(5) 


Ultimately, it is the gap in virtual valuation between these two cases that is responsible for the 
lower bound in Theorem 13.21 


A Preliminary Lemma Let denote the monopoly price in an auction with a single type 
A bidder. We define = ^, as this is the largest quantile q for which ip{q) > 0 when cr = 0. 
For (a > 0 we begin by determining the monopoly price for F^. We note that q:^{v) =0 when 
V — From Q, we deduce this occurs at q^ — for 0 < cr < 1, and at ^ for cr = 1. 

Thus, for bidder i, for 0 < cr < 1, q^ = max{^, and for cr = 1, = max{^, ^ 

(as we can assume that /c > 2, 5 < 1). 

Let denote the distribution of a type A bidder. Now let 'u* = (F^’^)“^(max{l — 

the value corresponding to quantile min{g^, in and let F* = min{/cg^, 

k times the revenue at this quantile. From Q, we obtain that 


V 


* 


/ 

1 

1 — 0 


In max 

< 

1^5 = Inmaxje, 


if 0 < a < 1 
if a = 1. 


( 6 ) 


Lemma 5.1. (Upper Bound on Optimal Revenue). The expected revenue (overw) of the 
optimal auction (with respect to the Hi’s) is at most F*. 


Proof. First, the expected revenue of the optimal auction is upper bounded by that of the 
optimal auction for the case where all Hfs are (F'^)“^(l — — i.e., where Fi — F^’^ for every 

i. Second, by symmetry, when bidders valuations are i.i.d. draws from F^’^, every bidder has 
the same purchase probability q in the (symmetric) optimal auction, and since there is only one 
item, this purchase probability q is at most it is also at most q^. Third, we obtain an upper 
bound by dropping the constraint of selling only one item and instead optimally selling to each 
bidder with probability q. Fourth, this is precisely k times the revenue of selling to a single 
bidder with valuation from F"^’^ using the posted price (F"^’^)“^(l — q). Fifth, by regularity, 
selling to a single bidder with posted price (F^’^)“^(l — q) with q < min{g^, is no better 
than selling with posted price = (F^’^)“^(max{l — ^^}). The expected revenue from 

any one bidder is therefore at most the sale probability times t*, namely min{g^, |} • t*. The 
overall revenue, with k bidders, is thus at most k x min{g^, = F*, as claimed. □ 


Overview of Proof The high-level plan is the following. Fix an arbitrary auction strategy. 
Think of the random choices as occurring in three stages: in the first stage, the F^’s are chosen; in 
the second stage, m sample valuation profiles v(^\ ..., v^^) are chosen (i.i.d. from Fi x • • • x F^f)] 
in the final stage, the input v is chosen (independently from Fi x • • • x Fj^). We prove that 
the expected revenue of the auction strategy (with respect to all three stages of randomness) 
is at most 1 — e(a, 6) times that of the optimal auction (with respect to all three stages or, 
equivalently, the first and third stages only)j^ Again, 1 — e((T,5) < 1 will be independent of k. 
This implies that, for every auction strategy, there exists a choice of Fi^... ^F^ such that the 
expected revenue of the auction strategy is at most 1 — e{a^ 6) times the expected revenue of 
the optimal auction for the distributions Fi,..., F^^. 


^We prove this statement about the expected virtual welfare, which is equivalent by Theorem 


2.1 
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By Lemma 5.1, i?* is an upper bound on the optimal auction’s expected revenue (equiv¬ 
alently, expected virtual welfare) for every choice of The main argument is the 

following: there is an event £ such that, for every auction strategy: 

(i) the probability of £ (over all three stages of randomness) is lower bounded by a function 
7 ( 5 ) of 6 (and independent of k and a); 

(ii) given f, the expected virtual welfare of the auction strategy is at least e((a,5)i?* smaller 
than that of the optimal auction, where e{a^ 5) > 0 is a function of a and 6 only. 

Since by Q, for each set of bids, the virtual welfare earned by the optimal auction is always 
at least that of the auction strategy, (i)-(ii) imply that the expected virtual welfare (and hence 
revenue) of the optimal auction exceeds that of the auction strategy by e((a,5)i?* for some 


constant e{a^6) depending on a and 6. Lemma 5.1 then implies that the auction strategy’s 
expected revenue is at most 1 — e((a, 5) times optimal. 


The Main Argument To define the event £^ we use the principle of deferred decisions. We 
can flip the second- and third-stage coins before those of the first stage by sampling quantiles 
— {m + l)k i.i.d. draws from the uniform distribution on [0,1]. (Once the distributions 

are chosen in the first stage, the valuation is just F^^{1 — We further break the 

first-stage coin flips into two substages; in the first, we determine bidder types (A and B); in 
the second, we choose iL^’s for the type-B bidders. The event £ is defined as the set of coin flips 
(across all stages) that meet the following criteria: 

(PI) There are exactly two quantiles of the form that are at most say of bidders j 

and 1] 

(P2) and are greater than 

(P3) for i = 1, 2,..., m, q^^^ and q^^^ are greater than 

(P4) one of the bidders is type A, the other is type B (we leave random which is which); 

(P5) the type B bidder (from among j, 1) has valuation equal to the maximum valuation from 
its distribution. 

The next lemma corresponds to step (i) in the proof approach above. 

Lemma 5.2. The probability of £ (over all three stages of randomness) is lower bounded by 

32e3 • 

Proof. We first sample the k quantiles corresponding to the third stage. Elementary computa¬ 
tions show that property (PI) holds with probability at least ^5^ (independent of a and k). 
Conditioned on (PI) holding, (P2) holds with probability (P3) is independent of the first 
two properties, as it depends only on the second-stage randomness, and it holds with constant 
probability of at least ^ (independent of a^k). Proceeding to the first stage, (P4) is indepen¬ 
dent of the first three properties and holds with 50% probability. Conditioned on (PI), (P2), 
and (P4) (as (P3) is irrelevant), the probability of (P5) equals the probability that a uniform 
draw from [^, |] (used to determine the iL-value) is at least the g-value of the type B bidder, 
which is conditionally distributed uniformly on (^, fj- This happens with probability We 
conclude that all of (P1)-(P5) hold with a positive probability, namely 

*>■£, 

□ 
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To work toward statement (ii), we next prove that, for every auction strategy, conditioned 
on the strategy fails to allocate the item to the optimal bidder — the type-B bidder with 
its maximum-possible valuation — with constant probability. It suffices to analyze the auction 
strategy that, conditioned on maximizes the probability (over the remaining randomness) of 
allocating to the optimal bidder — of guessing, from among the two bidders that in 
have valuation at least —|), which one is type A and which one is type B. Since the two 

bidders were symmetric ex ante, Bayes’ rule implies that the probability of guessing correctly 
(given £) is maximized by, for every ..., choosing the scenario that maximizes the 

likelihood of the valuation profiles ..., (given £). 


Lemma 5.3. Every auction strategy, conditioned on £, allocates to a non-optimal bidder with 
probability at least 


Proof. The only valuations that affect the relative likelihoods of the two scenarios are 

and We already know the optimal bidder is either j or i. Property (P3) of event £ 

implies that the m sample valuations from j and £ are equally likely to be generated under 
the two scenarios — the distributions of type-A and type-B bidders differ only for quantiles in 

[Od]- 

Now, given and the posterior probabilities of the two scenarios are not equal. 

The reason is that, conditioned on £, the type-A bidder’s valuation is distributed according 
where q is uniform in f], while the type-B bidder’s valuation is distributed 
according to the smaller of two i.i.d. such samplesThus, assigning the item to the bidder of 
j,£ with the lower valuation (in maximizes the probability of allocating to the optimal 

(type-B) bidder. The probability that this allocation rule erroneously allocates the item to the 
type-A bidder is the probability that a sample for a distribution (the type-A bidder) is smaller 
than the minimum of two other samples from the same distribution (the type-B bidder), which 
is precisely □ 


The following lemma completes the proof of Theorem 3.2 


Lemma 5.4. The revenue of every auction strategy is at most the following fraction of an 
optimal auction's revenue: 


1 - 
1 - 
1 - 

1 - 
1 - 


l-ln2 1 • n 

96e^ lnmax{e,A;} e^ 


{l-a2 

{l-a2 


1 — 


96e^ 


1 —q:\ 


jl+a 

xl+» 


96e3 (fcQ!V(i-«))« 


(1 -a2^-“) 




96e^ 


if a = 1 

ifO<a<l and ^ < Qa = 

fv 

S 1 

ifO<a<l and — < Qa — < — 

■' 2k ^ k 

ifO<a<l and qa — 

-2k 

if a = 0. 


more detail, consider a type-B bidder i and condition on the event that its quantile qi = 1 — Fi{vi) is 
in [^, f] and that its valuation is its maximum possible, which is equivalent to the condition that its fictitious 
quantile q'^ that generates its threshold Hi lies in |]. The joint distribution of (^i, q'i) is the same as the process 
that generates two i.i.d. draws from [^, |] and assigns qi and fi to the smaller and larger one, respectively. Note 
that the valuation of the bidder is, by definition, — ql). 
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Proof. Condition on the event E. By the virtual value of tho type B bidder i equals 
Hi > )5 substituting g = | in Q yields a lower bound on Hi which implies that 

ii a < 1 
ii a — 1. 

For the type A bidder, by the virtual value ip a is at most a • {F^) — 1; using @ 

again, this implies 



f -L_ 

\(>- 


> 

I l-a 

1< 



1 1^1 




J a 

1 

1 —a 

1 - 

1 

d 

1 

1 —1 

[ In 

2k 

s 

i 


< 

Thus, still conditioned on T, 

> 


V 1—a 


if (a = 1. 


if«<l 

1 — In 2 if (a = 1. 


We now bound the fractional loss of revenue. By Lemma 5.2, £ occurs with probability at 


least 5^/(32e^). By Lemma 5.3, conditioned on £^ a type A rather than a type B bidder is 
wrongly allocated the item with probability ^. Thus the expected loss of revenue is at least 


1 52 


3 32e3 


{(PB - (Pa)- 


Recall from Lemma|5.l|that the optimal revenue is bounded above hy R* = k x min{g^, \} -v*, 


where 'u* = ^(max{l — q^, is the value corresponding to quantile min{g^, in 

Recalling (IgI), we can lower bound the fractional loss of revenue at follows. 


If (a G [0,1) and > |, then the fractional loss of revenue is at least 


52 






(I) 


1—a 


1 — , 

^ 1 crz 


3-3263 R* 3-3263'" - 1) ^ 96e3 

If a G [0,1) and < ^, then the fractional loss of revenue is at least 


1 


3 • 32e3 


(1 - q ;2 


1 — 


1 

1 —a 


52 


_(i) 

-a 

-1 


> 


(l-a 2 i-“) ( 1 )^ “52 
96e^k{q^)^ 

(1 - 

96e^{q^k)^ 


If q^ = this becomes 


and ii q^ — ^ this simplifies to 


(1 - 

2^{l-a2^-^)5 
96e^ 

Finally, if a = 1, as the fractional loss of revenue is at least 


ict — 2k 


1 -ln2 


96e^ min{ 1, |} In max{e. A:} 


-5A 


□ 
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6 The Upper Bound 


Section |6.1| describes in detail the empirical Myerson auction, the auction strategy for which 
the guarantee in Theorem |3.1| holds. 


6.1 The Empirical Myerson Auction 

In the empirical Myerson auction^ we assume we are given m independent samples from each 
distribution Fi. The gist is to treat the resulting empirical distribution as the actual distribution 
in a Myerson auction (Section]^, though some additional technical details are required. In our 
variant, a number of the samples with the highest values are discarded, and there is a further 
detail regarding how to handle any high bids that occur in the auction (i.e. bids larger than the 
largest non-discarded sample). 

In detail, for each bidder i, we use the samples from Fi to construct an “empirical revenue 
curve” as follows (see also Figure |^: 

1 . Suppose that the m independent samples drawn from Fi have values vn > Vi 2 > ... > Vim> 
Define the “empirical quantile” of Vij as 

2 . Discard the [|mj — 1 largest samples, for a suitable | > o|^ Let S denote the remaining 
samples. 

3. For each remaining sample Vij G 5, plot a point 

4. Add points at ( 0 , 0 ) and ( 1 , 0 ). 

5. While only needed for the analysis, it will be helpful to define the “empirical revenue 
curve”, Ri{q): this is the curve comprising straight-line segments joining the sequence of 
points specified in Steps 3 and 4 above. 

6 . Take the convex hull — the least concave upper bound — of this point set. Denote the 
resulting “ironed empirical revenue curve” by CR^. This curve has constant slope between 
any two consecutive empirical quantiles of points of S. 

Now define empirical ironed virtual values as follows. For v > v. it is simply the value v. 
For V < identify the two samples Vij^Vi(^j_^i^ G S that “sandwich” v. The empirical ironed 

virtual value of v is defined as the slope of the revenue curve CR^ in the interval defined by the 
empirical quantiles of Vij and Note that the empirical ironed virtual value of Vi^^ is 

also Vi^^^ and that the empirical ironed virtual valuation is a nondecreasing function of v. 

Finally we run Myerson’s auction on these empirical ironed virtual valuations. That is, the 
item is awarded to the bidder, if any, with the highest non-negative virtual value (with ties 
broken arbitrarily). The winner’s payment is the lowest bid needed to ensure a (tied) win. 


6.2 Notation 

We next specify notation so as to clearly distinguish parameters for Myerson’s optimal auction 
from those for the empirical auction, as our analysis will be repeatedly comparing these two 
auctions. After a couple of simple results. Lemma |6.2| bounds the empirical quantiles as a 

^^The reason for discarding the largest samples is that if they were present there is a non-negligible probability 
that they would create a poor approximation at the high value end of the distribution, which is the end that 
matters the most. See also m- 

V is one of the points of S and there are multiple choices for this slope, we take the largest one. 
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Figure 2: Construction of the ironed empirical revenue curve. 


function of the actual quantiles, and vice versa (this is essentially Lemma 4.1 in [HI)- Next, 


Lemma [6^ relates the empirical and actual virtual values. With these in hand, in Section [63| 
we bound the expected revenue loss due to using the empirical auction as opposed to Myerson’s 
optimal auction, assuming for the latter auction that the actual distributions were fully known. 


Myerson’s Auction Let MR (the “Myerson Revenue”) denote the expected revenue earned 
by Myerson’s auction. Let Xi{q) denote the probability that bidder i wins in Myerson’s auction 
with a bid that has quantile q in its value distribution. Recall that Vi{q) denotes the value 
corresponding to quantile q and (pi{q) denotes the virtual value at quantile q. Let MR^ = 
Eq. [cpi • Xi] denote the expected revenue provided by i in Myerson’s auction (recall Theorem 2.1). 
Let qi{v) denote the minimum quantile for value v] sometimes it will be convenient to let ql 
denote qi{v). Let be the reserve price applied to i in Myerson’s auction, namely the largest 
value for which (pi{qi{v)) = 0. Let qr- denote qi{ri). Let SR^ = E[(pi{q) \ q > Vi] — qr- • rp note 
that SRi is the expected revenue if i were the only participant in Myerson’s auction (SR^ is 
short for “Single buyer Revenue”). Sometimes, to reduce clutter, we suppress the index i when 
it is clear from the context. The following claim is immediate from the definitions. 


Claim 6.1. i. MR = MR^ 

a. SRi < MR for all i. 


The Empirical Auction The empirical auction is defined in terms of the “empirical quantile” 
g, but its analysis will entail considering its revenue as a function of the actual quantile q. We 
specify notation which will distinguish between these two parameters. For v < we define 

the empirical quantile q{v) as the solution to v • q = Ri{q) in q. (If there are multiple solutions, 
we take the smallest one.) Going the other way, for an empirical quantile ^ > <^, we define v{q) 
as the solution to v ’ q — Ri{q) in 'L’. The empirical ironed virtual value Vi{q) of an empirical 
quantile g ^ is the slope of CR^ at q. Let Xi{q) denote the probability that bidder i wins 
in the empirical auction with the bid Vi{q). Let fi denote the empirical reserve price, which is 
the minimum of Vi{^) and the largest value v for which (fi{qi{v)) = 0, and let denote the 
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corresponding empirical quantile. Again, to reduce clutter, we sometimes suppress the index i 
when it is clear from the context. 

The actual quantile q corresponding to empirical quantile q is defined by the relation Vi{q) — 
Vi{q)] it is denoted by q{vi{q))] we write it as q for short. Finally, we write the empirical 
probability of winning as Xi{q) = Xi{q). 

As the auction may draw values Vi > Vi{^) it is convenient for the purposes of our analysis to 
define q for q < ^. Let be defined by For qi < we define the corresponding 

value of g, as q{q) — ^q. Then, for q < 


6.3 Relating the Actual and Empirical Quantiles 

The following result is essentially Lemma 4.1 in m- 


Lemma 6.2. Let F he a regular distribution. Suppose m independent samples with values 
xi ^ X 2 > ... > Vm ciTe drawn from F. Let 7 > 0 , <^=^<1 for some integer a > 0 be given, 
and let v be defined 6 ^ 1 + z/ = (1 + 7 )^. Let t^ — Then, for all v < 


q{v) € 


Qjv) 

.(1+7)2’ 




q{v) 

.(1 + 0 ’ 


q{v){l + u) 


or equivalently 


with probability at least 1 


q{v) € 


g(0 

.(1 + 0 


q (?;)(1 + i /) 


5, if 'y^m > 1 and m > max{^, In 


Proof We begin by identifying a subsequence of the samples, with h < h ^ 

...< Is] we rename the sequence izi, 1 ^ 2 ,..., for notational ease. It will be the case that 
tii^i ^ (1 + for 1 < z < 5, and + 7 ) > 1. We will show that 


q{vh) e 


th 

(1 + 7)’ 


i/i(i + 7) ) 


for Vh EU = {«!, . . . ,Ms}. 


The claimed bound is then immediate as either each v < is sandwiched between two items 
in t/, or it is at most Ug. 

We define the k as follows: li = and = [(1 + 7 )/^] if [(1 + 7 )/^] < m, and otherwise 
li^i is not defined (i.e. i = s). As > 1 , [(1 + > h + 1, from which 

we conclude that the sequence is strictly increasing and hence well defined. 

Next we bound the probability that q{ui) > (1 + 7 )^/.. Now q{ui) > (1 + 7 )^/. only if fewer 
than li = ti.mF ^ samples have q values that are at most (1 + 7 )^/.. As the expected number of 
such samples is (l + 7 )t/.m, a Chernoff bound gives the following upper bound on the probability 
that q{ui) > (1 + 7 )tz. (cf. [3l]): 


exp{- 


2(1 + 7) 


}• 


Similarly, the probability that q{ui) < tij(l + 7 ) is bounded by 


exp{- 


3(1 + 7)^ 


It will be helpful to bound both tym and — f/Jm. As > 1 , tym = (2^m — l)/2 > 
And as yfru > 1 , - tif\m > [(1 + 7 )/,] -li> [yk] > [y^m] > \yim. 
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Now, by the union bound applied to all the q{ui)^ we obtain a failure probability of at most: 


, .a ) + , .a } 


i=l 


3(1 + 7)- 


2(1 + 7)- 


< 


2 y^exp{- 




} 


1=1 

1-1 


< 2 ^ exp - 


3(1 + 7) 

7^[(^m + (i — l)7^m] 


i=H 


6(1 + 7 ) 


using the bounds on ti^m and —ti.]m 


< 


2exp{- 


“''+’3 <3expi-''«” 


l-exp{-^} 

7^^m 


6 ( 1 + 7 ) 


if exp < — 


6(1 + 7 ) 


1 

< 

- 3 


We want the failure 


probability to be at most 5. So we need f 5 — 


We also need vn > In 3 to satisfy the condition in the final inequality. 


6 ( l ±+ in 3 _ 

□ 




6.4 Relating the Actual and the Empirical Virtual Values 


Let Ea be the event that the high probability outcome of Lemma |6.2| occurs, namely that for 


all V < , q{v) 


( 1 +^) ; g('^)(l + ^) ■ occurs with probability at least 1 — 5. It will also 


be helpful to express the bound on u as a bound on q. To this end, we define ^ = ti = ^^7- 
We will repeatedly encounter terms of the form (p{\q) with A > 1; For Xq > 1, ip{Xq) is 
interpreted to mean (y?(l); similarly for (p. 


Lemma 6.3. Conditioned on Sa, for all empirical quantiles q > ^, CR{q) < q ■ v{j^), and for 

>th-v{ 4(1 + u)). 

This lemma is not as obvious as it may seem for it concerns points on the convex hull CR 
of the set of points R that are used to specify the empirical revenue. 


Proof. By Lemma 6.2, as Sa holds, for all > C? 


th 


th • + ly)) < R{th) < th • '^(^yp^)- 


We define L{q) = q • v{q{l + z/)) and U{q) = q • '^(p^) for all q. 

Note that for any pair q ^ q' oi quantiles, the line joining the actual revenue i?(p^) = 

Y^'z;(y^) to i?(Y^) = Y^'r’(Y^) is parallel to the line joining U{q) to U{q')^ for the latter 
line is obtained by expanding the former line by a factor 1 + z/ in both the quantile and revenue 
dimensions. By the regularity of ( 7 , the curve defined by R is convex, and consequently, the 
points U{q) all lie on their convex hull. 

For th > C 5 U{th) is an upper bound on R{th)] it follows that the convex hull for the 
empirical revenue, for g > ^, is enclosed by the convex hull U{q)^ and consequently CR(g) < 
u{Q) = rv{j^j. 
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For the second result, the lower bound, we use a similar argument, but it will apply just 
to the empirical quantiles > C- Now, for any pair q ^ q' oi quantiles, the line joining the 
actual revenue R{q{l + v)) = q{l + v)v{q{l + v)) to R{q'{l + z^)) = q'{l + y) • v{q'{l + v)) is 
parallel to the line joining L{q) to F(g'), and hence the points L{q) all lie on their convex hull. 
But, for th > C, L{th) < R{th)^ and consequently the values R{th) all lie on or above the curve 
1(4). □ 

The following lemma, which lies at the heart of out analysis, shows that with high probability 
is close to some value (f{q') with q' G [ (i^axi+i/)3 ’ ^(^ + + ^)]* 

Lemma 6.4. Let F he a regular distribution. Suppose that (1 + A) > (1 + z/)^. Let 4 = 
for 1 < h < m. Conditioned on £a, i/4-i < Q then 

i. for all q with < q, (p{q) < (i+a)'(i+^)3 ) + + + 

and 


a. for all q with ^ < q, (f{q{l + A)(l + u)) < <^(4(1 + 0) + 2^^ < ‘f{q) + 2s^- 

Proof. The main part of the proof concerns the second inequality in (i) and the first one in (ii). 
We begin by proving the inequality in (i). First we give an upper bound on and a 

lower bound on y( (i+AKi+^)3 )• 

As F is regular, R is convex; thus: 

/ 4 X ^ -^( (l+t)^ ) “ (l + A)(l+J/)4 ) 

s _ 

^ (i+i^r (i+A)(i+i/)4 

^ ( t 4 ^ -^(( 1 +^^ ~ {l+A){l+u)^^({l+A)(l+u)^)"> 

_ _ th _ 

(l+i/)2 (l+A)(l+i/)4 

(1 + A)(l + - ^((i+aki+^)^) 

2z/ + z/2 + A(1 + z/)2 


The following bound applies only when ^ otherwise CR(^Y 4 :^y(i:pP) 3 ) 

defined; the constraint q > (^(1 + A)(l + z/)^ suffices. 




(l + A)(l + z/)3^ 


> 


> 


th _ th _ 

(1+J.)3 (1+A)(1+J.)3 

( t 4^ • ~ {l+A){l+uy-i'^({l+A){l+u)'^) 

th _ th _ 

{l+u)3 (l+A)(l+j/)3 

^)'^( (l+^i/)2 ) “ ^((l+A)(l+i/)4) 

A ■ 


(by Lemma 6.3) 


Now, we combine the bounds so as to eliminate the term v{ )• 
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2v + v^ + A(1 + v) 




th 




< 

< 


A + ^^(l + A)(l + J/)3' 

(1 + A)[(l + uf - (1 + AM^^) 

A 

(1 + A)( 2 z/ + z/^) SR(1 + u) 


A 


th 


(as SR > 


th 


(l + Z/)2 V(l + ^)" 


th 


< 2^{l + A){l + uf^. 


In other words, 

Z/'(l + A) (2 + v) 


1 + 


Thus 


A 


‘Pi: 


pi: 


th 


th 


ii + i^y 

2 / < ‘Pi - 


)-pi 


Q X ^ ^ / AX / SR 

{i + A)(i + o=>) - + + y 


.. . w. ySR 


(l + A)(l + i/)3) + 2x(l + A)(l + i^) _ 


■{i + uy 

The second inequality in (ii) is shown similarly. We start with an upper bound on + 

A)(l + z/)) and a lower bound on (p{th{l + z/)). The first bound applies only when > ^5 here 
q > ^ suffices. 


(/^(g(l + A)(l + z/)) < (^(tzi(l + A)) < 


< 


CR(4(1 + A))-CR(4) 

(1 + A)tzi — th 

4(1 + A) • - 4^^(4(1 + 0) 

^th 

il + A)vCP^)-v{tnil + v)) 

A 


(by Lemma 6.3) 


</3(4(1+0) > 


Ri 


ih(l+A)N 
( 1 +^) > 
ih(l+A) 


- -R(4(i + 0) 

~ i/i(l + 


^ v)v{thil + 0 ) 

4(1 + 0 

(1 + AM^i±^) - (1 + 0M4(1 + 0) 

A - 2i/ - 1/2 


Again, we combine the bounds so as to eliminate the term n(4(l + ^Y}- 


ipiq{l + A)(l + 0) - .Z. +(4(1 + 0) 


< 


< 


A{l + pf 

[1 + A)[{A-2v-vY-^^^ 


vi 


^fc(l + A) N 
l+v > 


A(A - 21 / - 1/2 
(l + A)t /(2 + 0 SR{l + v) 
A (1 + z/)^ (1 + A)tzi 


(as SR > 

^ “1 + 1 / 


< 2 ii^. 

- A g 


^ (1 + A)4 ^ 
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In other words, 


V?(g(l +A)(l + i/)) 


1/(1 + A)(2 + I/) \ 

A(l + 02 J 


/ / XX ^ ^ SR 

(p(th(l + iy)) < 2^-^. 


Thus 


77 SR 

(p(q(l + A)(l + z/)) < (p(th(l + z^)) + 


We no w sho w the remaining inequalities. To obtain the first inequality in (i), we note that 
and Sa^ q > (T+iq ^ which the result follows. Similarly, for the 


by Lemma 


6.2 


second inequality in (ii), + z/) > q{l + u) > g, and again the result follows. 


□ 


6.5 Bounding the Expected Revenue Loss 

Finally, we consider an auction with k bidders, where the valuation for the ith bidder comes 
from regular distribution Fi. For brevity, bidder i is referred to as i. 

We define Ei) to be the event that £a holds for every distribution Fi. 

Let Shtf = ‘ ^i] ~ ‘ ^i]‘ other words, MR + Shtf = MR, so it suffices to 

show that Shtf < eMR. Recall that qr- denotes the quantile of Fi corresponding to the reserve 
price for i in the Myerson auction and qr. denotes the quantile corresponding to the reserve 
price in the empirical auction. Also, we let qi be a quantile for i in the empirical auction, and 
we let qi denote the corresponding quantile in Fi. qj and qj are defined similarly with respect 
to j. In addition, to reduce clutter, we let /3 = (1 + A)(l + — 1. 

The next lemma provides an upper bound on Shtf as the sum of several terms which we will 
bound in turn. 


Lemma 6.5. Conditioned on £}), 


Shtf = 


I L 

S E 


/ ^i{qi)-Xi{qi)dqi- <Pi{qi) ■ Xi{qi) 

Jqi<qr. dqi<qr. 


dqi 


qi 


I (fiiqi) ■ [Xi{qi) - Xi{—^)] dqi 

qi<qri ^ + P 

+ PY q:>i{qi) dqi 

■ Jqi<qr^ 


+ 


S/ 

7 Qr. 


[-ipi{qi)] dqi. 


( 7 ) 

( 8 ) 
( 9 ) 


lqr^<qi<qr^ 

Proof. We upper bound the second (negative) term in the expression for Shtf. 

- / ^i{qi) ■ Xi{qi) dqi = - rMi) ■ Xi{qi) dqi + / [-^i{qi)] ' Xi{qi) dqi 

d Qi<Qr. d Qi<Qr. 


^(lr,<qi<qr. 


< - / q^i{qi) ■ Xiiqi) dqi + / [-q:>iiqi)] dqi 

dqi<qrj(i+l^) Jqr.<qi<qfi 
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and 


-L 




‘PMi) ■ Xi{qi) dqi 

-(1 + ^) / ‘fiiqi) ■ Xi{qi) dqi + (5 / ^i{qi) ■ Xi{qi) dqi 

J ao<arJ(l-\-0) <7z<Q'r^/(l+/5) 


qi<qrJ{l+(3) 
Qi 


<-/ + B 

lqi<qr, + P 


-^) dqi + P / Lpi{qi) • Xi{qi) dqi 

-‘-+P dqi<qr. 


L 

/ ^pi{qi) ■ Xi{--^) dqi+ 13 / (Pi{qi) dqi. 

d qi^qvA ' d 


Substituting in the expression for Shtf yields the result. □ 

The bound on Q is simply 

^ SRi = fe/3 • MR. (10) 

i 

In the following lemmas we bound the terms Q and Q. To bound Q we partition the integral 
into two intervals. The intervals are the ranges qr- < qi < max{(^^,g^.} and max{^^,g^.} < qi < 
qr -, respectively, where is the quantile of Fi corresponding to empirical quantile 


Lemma 6.6. Conditioned on S^, 


F [ [-+i{<li)] dqi < k^{l + u) ■ MR. 

■ dqr.<qi<ma,^{^i,qr-} 


Proof. If < qr- the integral is zero and the result is immediate. So we can assume that 
Ci ^ Qvi- Note that —(pi{qi) is a non-decreasing function of thus its smallest values in the 
range qi > qr- occur in the integral we are seeking to bound. It follows that 


/ 

^ Oi 


[-(fiiqi)] dqi < f [-^^{qi)] dqi 

qr^<qi<^i dq^.<q^ 

j dqi 

dqr, <qi 


— I dqi 

d qi<qr.l ^^<qi<^ 

= • SRi <^i’MR< C(1 + jy) • MR. 


(as / 
Jo< 


+i{qi) dqi = 0) 


The last two inequalities follow from Claim [6T[ ii) and Lemma 6.2, respectively. The result 
follows on summing over i. □ 


Lemma 6.7. Conditioned on 


E 


Yi / [-‘fiiQi)] dqi 

■ dma^{^i,qrj<qi<qr. 


< 


2u Y SRi < ‘^ku - MR. 


Proof, let Xi — Qn} and let Xi — be the corresponding empirical quantile. 

Again, if Xi ^ Qvi fhe integral is zero and the result is immediate. So we can assume that 


Xi < qfi‘ The derivation below uses Lemma 6.2 to justify the first and third inequalities; the 
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second inequality follows from the definition of ri as the empirical reserve price since 
Conditioned on 


Q.ri ’ i ^ 


Qfj • Tj ^ Xj-ViiXi) > Xi ■ Mxi) 

1 + u - 1 + 1 / “ (1 + 0 ^ ' 


Xi 


>l 

( 11 ) 


Thus 


L 


ma^{^i,qr-}<qi<qr- 


[-V>i{Qi)] dqi = Xi ■ ViiXi) - Qh ■ n < Xi ' Vi{Xi) 


1 - 


(1 + 0^ 


(by ([n])) 


< <2u-SRi<2u- MR. 

(1 + T^y 


□ 


It remains to bound term Q. 

Before proceeding to the next lemma we need some additional terminology, namely the 
notions of i-safety, and of large and small amounts. 

Definition 6.1. The vector of quantiles q — {qi^q2r '' 5 Qk) '1^ sci'ld to be i-safe if qh ^ ^ for all 

h y i. 

We will also write q = qj^ Q-ij) and q = q-i)^ when we want to focus on just two or 

one coordinates of the quantile vector. 

We define large and small amounts with respect to (pi{qi) and ^j{qj) as follows. 

Definition 6.2. Let p and p' he defined as in Lemma \6^ Suppose that (pi{qi) > (pj{qj). (pi{qi) 
is said to exceed ^i{qj) by a large amount in the following cases: 

L For qi > C(1 + /3); 

/X /X SR^ ^ u SR.- 

TiiQi) - TiiQj) ^ ^^ ^ 7 \ 

^ Qi ^ Qj 


a. and for qi < (^(1 + /3), 

/ \ / \ ^ ^ SR.- 

TiiQi) - TiKQj) > 

Otherwise, (pi{qi) is said to exceed ^i{qj) by a small amount. 

The following lemma bounds the probability that i wins by a large amount over j in the 
Myerson auction at quantile q, while j wins in the empirical auction at quantile q/(l + /3). 

Lemma 6.8. Conditioned on £}), for any pair i and j, and for any qi, if qj > then the 
probability of the following event is hounded by (1 + /3)^ — 1.- 


i wins in the Myerson auction by a large amount over j at quantile q, and j wins 
in the empirical auction at quantile q/(l + P), where q = (qi^q-i) is i-safe. 

Proof. We begin with the case that qi > (^(1 + A). Given £i), by Lemma [6^ for ^{1 + fd) < qi, 


SR. 


+i(®) < +i(Yf^) + 2f (l + /3)-^ andfor ^ < qj, ipj{qj{l +A){l + u)) < ^j{qj) + 2^^ 


Thus, 


+i(9i(l + ^)(1 + ^)) ^ ^jiqj) + 2 


A qj 


< +i(%) “ 

^ Qi 


< +i( 


Qi 


1 + /3' 
qj 


< Tii ) 


(from Definition Hl>) 
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Thus we have a lower bound of ipj{qj{l + A)(l + z/)) and an upper bound of on the 

remaining terms. Clearly these can both hold only for a limited range of qj and hence of 
which we bound as follows. Define qj = arginf^^.{(^j(gj(l + A)(l + v)) < ipi{qi/[l + /3])}. Then 
these bounds can hold at most for qj satisfying qj < qj < qj{l + A)(l + z/)(l + /3). To obtain a 
probability bound, one needs to express the range in terms of the qj quantile, namely ranging 
at most from |j/(l + z^) to min{l,|j(l + A)(l + z/)^(l + /3)}, i.e. with probability at most 
(1 + A)(l + z/)^(l + /3) — 1 = (1 + /3)^ — 1. 

When qi < ^(1 + /3), we proceed similarly. (The third inequality below follows because for 
% < C? the fourth inequality holds because < C by assumption.) 

u SR 

<fj{qj{l + A)(l + 0) < <fj{qj) + 2— ^ < (Pi{qi) 

LA qj 

< max{vi{qi),Vi{^i)} 

< (pi (———) < ipA —^—). 

The rest of the argument is as for (i). □ 


Lemma 6.9. Conditioned on S^, 

• J Qi<Qr, J- i“ P 


< 


{k - 1)^(1 + z/) + k{k - 1)[(1 + /3)^ - 1] + (p + p) + 4A;(1 + /3)(1 + z/)— • In 


1 + z/ 


MR. 


Proof. We let £f{qi) be the event that quantile q = (qi^q-i) is not i-safe. Clearly Pr[^^f] < 
'^j^i Cj ^ Let x^{qi) be the probability that i wins when the quantile q = q-i) 

is not i-safe. Note that Xi{qi \ q is not i-safe) < as having some qj be small only increases 

the probability that j wins. Thus 


xliqi) <{k- 1)^(1 + u)xi{qi). 


( 12 ) 


We let x\ denote the probability that i wins both in Myerson’s auction with i-safe q = 
{qi,q-i) and in the empirical auction at quantile q/(l + P). 

We also introduce notation to measure the probability of wins by small and large amounts, 
for i-safe quantiles. We will be measuring the probability that i wins in the Myerson auction 
at quantile q = q-ij) and j wins in the empirical auction at quantile q/(l + /3), for some 

q-ij> xfj{qi^qj) measures the probability of this event in the case that the win in the Myerson 
auction is by a small amount, and x\-[qi^qj) measures the probability of the event when the 
win margin is large. 

Switching perspectives, we let xfj{qi^qj) denote the probability that i wins in the empirical 
auction at quantile q/(l + /3) and j wins by a small amount over i in the Myerson auction at 
quantile q, where q = {qi,qj,q-ij) is j-safe. Clearly, x^--{qj,qi) = xfj{qi,qj). 

We also note that 


i{Qi) = xtiqi) + x\{qi) + / x^jqi, qj) + x\j{qi, qj) dqj, (13) 

j^i 


and for qi > 


1 + p 


dqj 


(14) 
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By Lemma 6.8 


/ . 

jQj>i 


_x\Mu(ij) < ( 1 + / 5 )^ - 1 - 


(15) 


We obtain: 


y] / ^Mi) ■ [xi{qi) - ^1(7-^)] dqi 
i P 

^ 4 (%) + (g*) + y / _x!j{qh qj) + 4 j{qi, qj) 

i Jqi<qn y y Jqj>i 

m 9i) I dq, 


dqj 


- Xi 


dqi (using (13) and ([M])) 
dqi dqj 


-y / _Xji{qj,qi) dqj dqi (16) 

j Jqj<qrj i^j Jqi>i 

< y / {k- 1)^(1 + 0 • ifiiqi) ■ Xi{qi) dqi + {k - 1)[(1 + ^)^ “ 1] / ¥^j(®) 

^ J(li<qrj^ Jqi<qrj^ 

(using (12) and ([T^) 


+ EE 

Z j/z L 


/ _7i(%) • xtjiqi, qj) dqi dqj 

d qi<qr^,qj>i 


- _^j{(lj)-Xji{(Ij >®) c?® dqj 

dqj<qrj:qj>^ 

<{k- 1)^“(1 + v)Y, MRi + {k- 1)[(1 + /3)2 - 1] ^ SRi 

i i 

+ J2J2 _[^i(®) • xtjiqi, qj) - ‘fjiqj) ■ x'^jiiqj, qi)] dqi dqj 

i j^iJqi<qri,qj<qrj,qj>i 

+ EE/ jPiiqi) ■ xlj{qi,qj) dqi dqj. 

i jjCi Jqi<qri,qj>qrj,qj>i 

We bound ((TtI)-® in turn. 

(k - 1)^(1 + j/) ^ MRi + (fc - 1) [(1 + /3)2 - 1] ^ SRi 

Z Z 

<{k- 1)^(1 + OMR + k{k - 1)[(1 + - 1]MR. 

We can deduce from that 

y / xlj{qi,qj) dqj < Xi{qi) < 1, 
j^i Jqj>l 


(17) 

(18) 
(19) 


( 20 ) 


( 21 ) 
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and from (14), for qj > (^, 




xtj{qi,qj) dqi = J2 / ^Mj^qi) dqi < < 1- 

Jqi>0 p 


( 22 ) 


For recall that xl-{qi,qj) = x^--{qj,qi). Thus when qj > if qi > ^(1 + /?), (pi{qi) 
AMi^qj) - ^Mj) ■ d^^jiiqj^qi) ^ (1 + A) ■ ^], and if qi < ^(1 + P), (Pi(qi) 


xlj{qi,qj) - ^Mj) ■ Xji{qj,qi) < Equivalently, if g* > ^(1 + /3), ipi{qi) ■ xlj{qi,qj) - 


-IJV 

SRn 


<lj 


Vj{qj) ■ x]i{qj,qi) < ""'l+y ^ [ 25 ^ + 2 f(l + p) ■ ^], and ifg* <^(1 + /?), 9i)- 

vMi) • ^M^qi) < ^3;^[2i^]- Thus 




2 ^ / A^Mi) ■ xtjiqi, qj) - ^jiqj) • x^jiiqj,qi)] dqi dqj 

i j^idqi<qri,qj<qrj,qj>(. 


< 


EE/ 
+EE/ 




dqi dqj 


Xij{qi,qj) _ 2 SRj 


1 + /9^ A Qj 


dqi dqj 


j^i ^qi^qvi:qi<^(^+(^):qj<qrj,qj 

<J2J2[ i +«- _^^A + P)A + i^)-^-xtj{qhqj) dqidqj 

i j^i dqi<qri,qi>j^^,qj<qrj,qj>^ ^ 


+ 


e/, 

• Jqr.>qi 


qr^>qj>Aji} ^ 


^ 1/ SR,(1 + Z^) , , . r^x 

2- - - - - dqj (using p^). 


qj 


(23) 


For (19), we note that as qj > qr-^ ^j{Qj) ^ 0? then when there is a small margin win 
by i, by definition, if ^(1 + P) < qu ipi{qi) < + 2^(1 + /3)^, and if ^(1 + P) > qu 


SR ■ 

^i{qi) < 2^-^. Also, the constraint qj > ^ implies qj > 


qi 


qj 


(1+^) 


Thus 


2 ^ 2 ^ / _q^i{qi) ■ xtjiqi, qj) dqi dqj 

i jjLi Jqi<qri,q3>qrj,qj>i 


sE/ , ,, 


,. cr) . 

2—(1 + u )—2 dqj (using @) 

A qj 


+ 


i j^i Jqj>qrj:qj>^:qi>^{^^^) 


V SR 

2^(1 + ^)(1 + 0—^ • xtjiqi, qj) dqi dqj (24) 
^ qi 
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Combining (|23[) and (24) yields 


J2J2 MiQi) ■ x!j{qi,qj) - (pjiqj) ■ x^ji{qj,qi)] dqi dqj 

i dqi<qri,qj<qrj,qjH 

+ EE , JPi{qi)xlj{qi,qj) dqi dqj 

I ^qi<qri,qj>qrj,qj>^ 


< 


■ J On 


z/ SR.- 

qi>Th ^ 


+ 


u. 


u SR 

a+B) ^ A + f3)il + u )—^ dqi (using 
qi>i^ ^ 


< y2(p+^/^)(i+i^)SRi In ■ ^ 


A 

3 

+ + z/)ln- 


e(l + /3) 




< 4fc£(l + /3)(1 + v) In i^MR. 


(25) 

□ 


We are now ready to bound Shtf. 

Lemma 6.10. 

Shtf < MR [k5 + k'^S + kp + fc^(l + u) + 2ku + {k - 1)^(1 + u) + k{k - 1)[(1 + pf - 1]] 
+ MR 


4fc(l +^)(1 + 


Proof. In the event that £a does not hold for some which occurs with probability at most 
k5^ the contribution to Shtf is at most 


k6 


/ <fiiqi)xi{qi) dqi- _ <fiiqi)xi{qi) 


dqi 


'qi<qr^ 

< MMR + My^ 


hr,<qi<qr 


qi<qri 

-ipiiqi)] dqi 


< kSMR + kS'^ ‘Piiqp dqi (as /^, (fiiqp dqi = 0) 

i Jqi<qri 

< kSMR + k5'^SRi< kSMR + fc^^MR. 


Otherwise, the contribution is given by summing the bounds from (10), Lemmas 6.6|-|6.7 
and 16.91 


□ 


Proof of Theorem \3.1\ We first choose A, z/, (^ < It is easy to check that then (1 + /3)^ — 1 = 
(1 + A)2(l + zy)6 - 1 < 2A(1 + A)(1 +1/)® + 6i/(l + A)2(l +1/)5 < (2A + Gn) (4|)^ < 4A + lli/. 
Similarly, P < 2A + 4r/, and 4(1 + P)(l + u) <4 (jf)^ < 4 • | = 6. Consequently, 


Shtf < MR 


k5 + k^5 + 2A:A + Aku + ?>k^ + 2A;z/ + A.k{k — 1)A + 11A:(A: — l)z/ + 6A:— In 


(1 + 0 
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It suffices that Shtf < cMR. To this end, we bound the right hand side of the above expression 
by e. To achieve this it suffices to choose z/, 5, and A as follows: 


k{k + l)5 
(4A:(fc - 1) + 2A:)A 
3A:C 

A.kv + 2kh' + 11A;(A: — l)v + In 

It suffices that 


5 = 

A = 


z/ = 



4^ 


1 

4^ 

1 

T' 


One final detail is that we need to set ^ also, but it suffices to note that ^ ^ 

{= e(f) + 


By Lemma 6.2, m — i2(^+ln ^ suffices. Recalling that 1 + v — (l+y)^, 


lU 

we obtain that m = In^ -) suffices. 


+ ^ and so 

SO 7 = 0(z/), 

□ 


7 Conclusions 

This paper proposes a general model for learning a near-optimal auction from data, in the form 
of i.i.d. samples from unknown distributions. It provides upper and lower bounds on the sample 
complexity for the case of single-item auctions, and shows that the number of samples required 
to obtain a (1 — e)-approximation of the optimal expected revenue scales polynomially with 
both the number k of bidders and with -. We conclude by listing some of the many directions 
in which this work could be extended. 

1. Prove tight bounds (in terms of k and on how many samples are necessary and sufficient 
to achieve expected revenue at least 1 — e times the maximum possible. (See [281 EH [K] 
for recent progress.) 

2. Prove good sample complexity upper bounds for settings other than single-item auctions. 
(See [32] for recent progress.) 

3. Are there natural settings where the learning problem is information-theoretically easy 
(meaning polynomial sample complexity) yet computationally hard (under complexity 
assumptions)? 

4. Identify multi-parameter problems, less general than those in m, where a near-optimal 
mechanism can be learned from a polynomial number of samples. 
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5. For problems where a (1 — e)-approximate mechanism cannot be learned from a poly¬ 
nomial number of samples, identify the best-possible approximation factor for which an 
approximately optimal mechanism can be efficiently learned. 

6 . If some of the bidders that contribute the samples are the same as the bidders that 
participate in the final auction, then these bidders need not bid truthfully. (Underbidding 
could result in lower payments in the future.) Is it still possible to learn a near-optimal 
auction in this setting? 
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